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Abstract 



a-helices stand out as common and relatively invariant secondary structural 
elements of proteins. However, a-helices are not rigid bodies and their defor- 
mations can be significant in protein function {e.g. coiled coils). To quantify 
the flexibility of a-helices we have performed a structural principal-component 
analysis of helices of different lengths from a representative set of protein folds 
in the Protein Data Bank. We find three dominant modes of flexibility: two 
degenerate bend modes and one twist mode. The data are consistent with 
independent Gaussian distributions for each mode. The mode eigenvalues, 
which measure flexibility, follow simple scaling forms as a function of helix 
length. The dominant bend and twist modes and their harmonics are re- 
produced by a simple spring model, which incorporates hydrogen-bonding 
and excluded volume. As an application, we examine the amount of bend 
and twist in helices making up several coiled-coil proteins. Incorporation of 
a-helix flexibility into structure refinement and design is discussed. 

I. INTRODUCTION 

Protein folds typically consist of two fundamental building blocks: a-helices and (3- 
strands. These secondary elements pack together to form the final tertiary foldit However, 
the constraints of packing may be inconsistent with idealized conformations of the helices 
and strands. To what extent are these elements flexible? 

One measure of flexibility is provided by a Ramachandran plot of the probability distri- 
bution of backbone dihedral anglesifl. In such a plot, a-helices appear as a high-probability 
peak around <fi = —50, ip = —50, while /5-strands form a more diffuse peak around cb = —120, 
ib = 120. However, the flexibility of helices and strands is due to collective motion of many 
residues, and cannot be adequately described by the distribution of single {0, ip} pairs. 

Collective deformations have been considered before in many biological contexts. Normal 
mode analysis of protein structure has been performed to extract the flexible modes of 
proteinsa'™QB&ty. The flexible motions identified in this way sometimes correspond to 
functional motions that the protein can performlll. The flexibility of double-stranded DNA 
plays an important role both in the packing of DNA0, and in regulation of gene transcription 
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via protein-DNA interactions@00. Recently, a principal-component analysis of database 
DNA structures was used to characterize the average deformation and deformability of all 
dinucleotide pairslll. 

We have employed a similar principal-component analysis (PCA) to quantify the flexibil- 
ity of a-helices. Helices with lengths ranging from L = 10 — 25 residues were extracted from 
a representative set of a, a + (3, and a/ (3 folds from the Protein Data Bank. We found that 
there are three dominant modes of flexibility: two nearly degenerate bend modes and one 
twist mode. It is natural to identify these as the three lowest normal modes of an a-helix, 
in particular since the distribution of amplitudes is consistent with independent Gaussians. 
According to elasticity theory, these low-lying normal modes should be insensitive to details 
of the interaction potential. Indeed, we found that a spring model with only two parameters 
reproduced not only the main bend and twist modes, but many higher-order modes as well. 

What relevance does a-helix flexibility have to biology? An obvious case is the forma- 
tion of coiled coils of a-helices. A coiled coil is a domain of two or more a-helices wound 
around each other to form a superhelix. The a-helices typically interact with each other via 
buried hydrophobic residues, salt bridges, and interlocked sidechainJiE. Such superhelical 
domains often contribute to protein-protein recognition, with helices from different proteins 
coming together to form the coiled coil. We have examined helices making up several dif- 
ferent coiled coils: the leucine- zipper, the tetramerization domain of the repressor Mnt, and 
chicken fibrinogen. For the leucine-zipper and the Mnt coiled coil (which are composed 
of relatively short coiled-coil helices) we find that the deformations can be accounted for 
primarily by the bend and twist modes. For fibrinogen, which is composed of much longer 
helices, higher order harmonics of bend are required to describe the deformation. In all the 
cases examined, helices making up coiled coils can be well described using a minimal number 
of the lowest normal-modes of our spring model. 

The quantification of a-helix flexibility may prove useful in structure refinement and 
protein design. For instance, folding studies that rely on rigid helical fragments may benefit 
from the inclusion of collective flexible motions to further optimize the energy of the sequence 
for the given fold@lll. A recent approach to protein design considers all possible packings 
of secondary structural elementsH2l; however, so far only idealized rigid helices have been 
considered. Based on the current results, the low-lying bend and twist modes of helices can 
be incorporated to allow a more realistic balance between packing and deformation energies. 



II. RESULTS 



A. Principal-component analysis of database helix structures 

Sets of a-helices of given length were extracted from a representative set of protein struc- 
tures (see Methods). To quantify the flexibility of these helices, we performed a structural 
principal-component analysis (PCA). For each length L, i.e. number of residues, we began 
by computing the mean helix structure via an iterative procedure. Starting with an ideal 
helix (periodicity 3.6 residues/turn, rise 1.5 A/residue), we aligned the C a positions of each 
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length- L fragment in the representative set to the ideal helixd. A mean structure was then 
obtained by averaging the position of each C a over these aligned structures. This procedure 
was then iterated, each time using the new, mean structure as the basis for the alignments, 
until the mean structure converged to within 10 -4 A/residue. An example of a set of aligned 
structures from the representative data set is shown in Fig. |V| for helices of length L = 18. 

The second step in the principal-component analysis was to compute the structural 
covariance matrix for each length L. The covariance matrix is a measure of correlations 
between coordinates. In our case, it is a square matrix of dimension 3L (three spatial 
dimensions for each of L C a atoms), with element i,j defined as 

1 N 

Q,j = IT _ { x mi ~ (Xi))(x m j — (Xj)), (1) 

iv 1 m =l 

where N is the number of helices of length L in the data set, x m i is the i th coordinate of the 
m th structure, and (xj) is the i th coordinate of the mean structure. 

We then computed the eigenvalues, {X q }, and eigenvectors, {v q }, of the covariance ma- 
trix. The largest eigenvalues and corresponding eigenvectors represent the directions in the 
3L dimensional space for which the data has the largest variance. These directions are the 
"soft modes" of the helices, i. e. those collective deformations which appear with largest am- 
plitude in the data set. Fig. 0(a) shows the top 10 eigenvalues for helices of length L = 18. 
Each eigenvalue is given in units of A 2 and measures the variance of the distribution for a 
particular mode. Three dominant eigenvalues are evident in Fig. 0(a). The first two modes 
are nearly degenerate and correspond to the bending of the helix in two orthogonal planes. 
The third mode is the overall twist of the helix. These modes are shown with exaggerated 
amplitudes in Fig. [V|. 

The scaling of the eigenvalues (i.e. variances) of the first three modes as functions of 
helix length is shown in Fig. 0(b) for helices ranging from 10 to 25 residues. The eigenvalues 
of the bend modes grow with helix length approximately as L 4 while the eigenvalue for twist 
grows approximately as L 2 . This difference occurs because bend modes induce displacements 
from the mean helix structure which grow quadratically with helix length, while twist modes 
induce displacements which grow linearly with helix length. A model for the scaling of the 
eigenvalues based on an elastic rod is presented later in the paper. 

Next, we look at the actual distribution of the data for the three dominant modes. For 
each helix fragment its displacement vector 5x = x — (x) can be expanded in terms of 
the eigenvectors giving 5x = J2q a qV q . The amplitude a q is given by the projection of the 
displacement vector onto mode q. Figure |V](a,b,c) shows histograms of the projections onto 
the two bend and one twist mode for helices of length L = 18. For both the two bend modes 
and the one twist mode, the data has a nearly ideal Gaussian distribution. Best x 2 -^s to 
Gaussians are shown by solid lines. By definition, the exact variance of each distribution is 
equal to the eigenvalue X q for that mode. The variances of the best-fit Gaussians, 1.55 A 2 , 
1.53A 2 , and 0.7lA 2 , for the two bend and one twist mode, respectively, agree well with the 



*Each structure was aligned so that the coordinate root mean square (crms) distance between it 
and the mean structure was minimized. 
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exact variances, 1.53A 2 , 1.5lA 2 , and O.66A 2 .0 

By construction, the modes derived from the principal-component analysis are uncorre- 
lated to lowest order. That is, the expectations (a q a q >) are all zero, where a q and a q i are 
amplitudes of different modes for a single helix. (This is simply the statement that the co- 
variance matrix is diagonal in the basis of the eigenmodes.) However, there is no guarantee 
that the modes are uncorrelated at higher order. To look for possible correlations, we made 
scatter plots of the amplitudes of the three dominant modes in all pairwise combinations, 
shown in Fig. |V|(d,e,f) for the 1182 helices of length L = 18. The distributions of points in 
all three scatter plots are roughly ellipsoids with axes along x and y, indicating that there 
are no strong higher-order correlations between modes. This type of behavior was seen for 
all of the helix lengths L = 10 — 25. 



B. Normal-mode analysis of spring model for a-helices 

While the dominant modes derived above come from studies of static structures, their 
properties are suggestive of dynamical normal modes. We show below that the two bend 
modes and one twist mode can be obtained from a simple model for the dynamics of a 
free a-helix. Moreover, the eigenvalue scaling and the uncorrelated Gaussian form of the 
distributions are characteristics of modes in thermodynamic equilibrium. 

From an energetic point of view, an a-helix retains its helical shape due to two primary 
interactions. The first is the backbone hydrogen-bonding interaction between residues i and 
i + 4. The second is the excluded volume interaction between backbone atoms. We model 
these two terms by springs connecting nearby C a atoms of an ideal helix. Again, we take 
an ideal helix to have periodicity 3.6 residues/turn and rise 1.5 A/residue. The potential 
energy for the spring-model of the helix is given by 

V = Y. E \K m {\n-r l+m \-dl l+m f (2) 

i m=l, 2,3,4 / 

where r*j is the position of the i th C a atom, and d®j is the equilibrium distance between the 
i th and j th C a atoms. In Eq. (0), there are springs connecting pairs of residues up to four 
apart along the chain, and so there are four spring constants K m for m = 1,2, 3, 4. However, 
we consider the limit K\ — > oo which holds nearest-neighbor C a atoms at a fixed distance 
of 3.8 A, and we set K 2 ^ = K 2 = K 3 , leaving only two spring constants, K 2 ^ and K A) as 
parameters. 

The normal modes of the model a-helix are obtained by diagonalizing the 3L x 3L spring 
matrix 

d 2 V 

^ J dxidxi ' ^ 



^The fitted values of the variances depend on the binning of the data. The shown binning yielded 
results that agree best with the exact variances of the distributions. 
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where Xi is the i th member of the 3L coordinates describing the helix. The eigenvalues K q 
determined from the normal-mode analysis represent effective spring constants for each of 
the normal modes. The matrix has six zero eigenvalues, corresponding to the six rigid- 
body rotations and translations, for which there are no return forces. The lowest non-zero 
eigenvalues are the lowest-energy normal modes of the helix. Over a broad range of values 
for the spring constants, the first two non-zero modes are bend modes and the third is twist 
for helices up to length 33 (beyond this length, higher order harmonics of bend occur before 
twist), consistent with the dominant modes of static helices found from the PCA. Typically 
for the lengths studied, the top 7 — 10 modes from the normal-mode analysis agree very well 
with those obtained from PCA. At thermal equilibrium, these dynamical modes would follow 
the Boltzmann distribution P(a q ) ~ exp(—K q a 2 /2k B T) where P(a q ) is the probability of 
observing the q th mode with amplitude a q and \K q a 2 is the potential energy of the mode. 

A more detailed comparison between the PCA and the spring model can be made by 
conjecturing that the Gaussian distributions of PCA modes represent equilibrium Boltzmann 
distributions at some effective temperature T*. (Below, we discuss the use of T* rather than 
room temperature T.) With this conjecture, one has the relation 



a 2 . , K^al 



exp( f ) = exp( q - i), (4) 

PV 2X q PV 2k B T* h v ; 

where the a q are the mode amplitudes. So the effective spring constants of the PCA modes 
are given by K^ PCA ^ = k B T* /X q . In other words, the PCA eigenvalues \ q can be interpreted 
as inverse spring constants, with a proportionality constant k B T*, i.e. X q = k B T* / K g PCA \ 
Figure 0(a) shows a plot of the eigenvalues X q for the first three PCA modes, compared 
with k B T* I K q using the spring constants K q for the first three low-energy modes determined 
from the normal-mode analysis of the spring model. The real-space spring constants that 
give this fit are K 2 ^ = 20 k B T*/A 2 and K 4 = 7 k B T*/A 2 . The agreement between the 
PCA modes and the normal modes of the spring model is striking for both eigenvalues and 
eigenvectors (Fig. |V](b)). Note that there are only two free parameters in the spring model 
i^2,3 and K4, and that the mode shapes depend only on their ratio. Thus the dominant 
modes of static a-helices extracted from the database can be identified with the normal 
modes of simple spring model for a helix. 



C. Scaling of the PCA modes 

Guided by the interpretation of the PCA modes as normal modes, the scaling of the PCA 
eigenvalues can be understood relatively simply in terms of the bending and twisting of an 
elastic rod. For a uniformly bent rod, the displacement away from vertical goes as Sx ~ l 2 /R, 
where I is the length along the rod, and R is the radius of curvature. The normalized eigen- 
vector describing this bending mode has the form v ~ (R 2 / X 5 ) 1 / 2 (L 2 / R, . . . , L 2 /R). Within 
a principal-component analysis, the eigenvalue for this bend mode is the average square of 
the projection of the displacement of the rod onto this mode. So the bend eigenvalue is 
given by 

Xbend = (\SX ■ V\ 2 ) ~ ( 



L A R 



R 2 L 5 /2 



>=*v 



(5) 
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At thermal equilibrium each normal mode has ksT/2 of potential energy. For the bend 
mode, this energy is put into the curvature of the rod, 

5*-r = i«t<^>. (6) 

Substituting this equilibrium relation for (l/R 2 ) into Eq. @ for the bend eigenvalue gives 

Abend ~ -^ 5 (^) = — ^ 

Thus from thermodynamic arguments, we find that the principal-component eigenvalue of 
the bend mode of an elastic rod scales as L 4 , as was found in Fig. (|V|) for the bend eigenvalue 
of a-helices. 

For twist, we assume that the rod twists uniformly by an angle 59 per unit length. 
The displacement associated with twist along the rod is given by 5x ~ 156, and hence the 
normalized vector describing this mode is v ~ 1/ (59 2 L 3 ) 1 / 2 (— L59 , . . . , L59). Using the same 
formulation as above, we find that the eigenvalue for the twist mode goes as, 



Kwist ~ ( 



(56 2 L 



2t2\ 2 



) = (59 2 )L 3 . (8) 



(56' 2 L 3 ) 1 / 

At thermodynamic equilibrium, the energy associated with the twist mode is 

l -k B T= C -(56 2 )L (9) 

where c is a spring constant associated with twist. Substituting this equilibrium result for 
(59 2 ) into Eq. flD for the twist eigenvalue gives 

\ tms t ~ (56 2 }L 3 = k -^L 2 . (10) 

So, we find that the principal-component eigenvalue of the twist mode of an elastic rod scales 
as L 2 , as was found in Fig. (|V|) for the twist eigenvalue of a-helices. 

Thus the eigenvalues of bend and twist extracted from the PCA of a- helices are seen 
to scale with length in the same manner as those of a fluctuating elastic rod at thermal 
equilibrium. The difference between the scaling exponents, L 4 for bend and L 2 for twist, can 
be traced to the length dependence of displacements. For bend modes, displacements grow 
quadratically with length, 5x ~ l 2 /R, while for twist modes displacements grow linearly, 
5x ~ 156. 



D. Application to helices forming coiled-coils 

In this section we examine to what degree helices making up coiled-coils can be described 
using the lowest-energy normal modes. We have chosen three representative coiled-coil 
structures. The first is a leucine zipper (2ZTA), which consists of two interacting helices. 
The second is the tetramerization domain of Mnt repressor which is representative of coiled- 
coils that form as a result of protein-protein interactions (1QEY). Lastly, we consider two 
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long helices that form a part of a coiled-coil in the structural protein fibrinogen from chicken 
(1M1J). The coiled coils that we have chosen to analyze are shown in Fig. [V[ 

For each coiled-coil helix we computed the normal modes of an ideal helix of identical 
length using our spring model. We then aligned the coiled-coil helix to the ideal helix and 
computed the displacement vector 5x, which by definition can be expanded in terms of the 
spring-model normal-mode eigenvectors 5x = J2 q dqVq- We then projected Sx onto each 
eigenvector yielding projection amplitudes a q . The percentage of the displacement vector 
due to a single mode q is given by a 2 ,/ a 2 ,. 111 Table |, we show the percentages of the 
coiled-coil helix displacements captured by the sum of both bend modes, Bend, the 2nd and 
3rd harmonics of bend, Bend^ 2 -* and Bend^ and lastly Twist. For the shorter helices in the 
leucine zipper and Mnt, we find that the helical displacements are described predominantly 
by the bend modes, with some twist. Thus coiled coils that are formed by shorter helices can 
be described well using just the bend and twist modes of the spring model. For the larger 
helices making up fibrinogen, where there is clear evidence of supercoiling, the 2nd harmonic 
of bend is required. The 2nd helix of the fibrinogen coil (green helix in Fig. 0c) has 68% 
of its displacement captured by the two 2nd harmonics of bend. Thus helical supercoiling 
is captured by higher-order harmonics of the fundamental bend mode. (For the helices of 
length 78, the 2nd and 3rd harmonics of bend are lower in energy than the twist mode - so 
twist is no longer the third lowest normal mode for longer helices.) 

III. DISCUSSION 

A. Connection between static and dynamical modes of helices 

Our principal-component analysis has shown that a-helices of lengths up to 25 residues, 
have three dominant independent "soft modes": two bend and one twist. These modes were 
determined from static a-helix structures in the Protein Data Bank. The principal modes 
determined from these static snap-shots agree extremely well with the dynamical normal 
modes of an a-helix obtained from a simple spring model. The projections of the static a- 
helices onto these three principal modes yield Gaussian distributions, which coincides with 
the distribution expected for dynamical equilibrium fluctuations. Why should an ensemble 
of static a-helical structures be related to the normal-mode fluctuations of a helix at thermal 
equilibrium? This connection can be understood if the ensemble of static a-helical structures 
has been sampled from a system that is under the influence of random forces. In a given 
protein structure, helices adopt conformations so that the forces acting on them add to zero. 
Over the entire ensemble of protein folds it is reasonable to assume that the forces that 
an a-helix experiences are approximately random. An elastic objected acted on by random 
external forces is equivalent to that same object at thermal equilibrium at some effective 
temperature T* . The fluctuations in the energy of the ensemble of a- helical structures set 
the effective temperature T*. (Since the forces, or more precisely, energies involved in protein 
folding (hydrophobic interactions, hydrogen bonding, Van der Waals etc.) all have the scale 
of the order of kcal/mol, or a few ksT, with T being the room temperature, the effective 
temperature T* obtained from PCA should be of the order of room temperature. Indeed, 
our fitted value of = 7/cbT*/A 2 , the hydrogen bond spring constant, is quite consitent 
with the hydrogen bond energy with T* being room temperature.) The distribution of static 
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helical structures sampled from a large ensemble of proteins will therefore have a distribution 
that is equivalent to a helix at thermal equilibrium at some temperature T*. If the forces 
that helices experience within protein structures were systematically non-random then the 
resulting PC A distributions would depart from those of a helix at thermal equilibrium. 



B. Incorporating helix bend and twist into models of protein structure 

The results presented here can potentially be applied to structure refinement and struc- 
ture design. Most off-lattice structure models of proteins fall into two classes: those with 
rigid secondary element sEHil, or those in which every atom is free to move independent lyUEl. 
The first has the advantage of locking out many of the degrees of freedom of the peptide 
chain. It has the disadvantage of potentially missing lower-energy conformations which could 
be accommodated if the secondary elements were flexible. The second approach, allowing 
every atom to move independently of the others, has the advantage that each atom is in 
principal allowed to find its equilibrium position within the fold. It has the great disadvan- 
tage of allowing all possible degrees of freedom, which greatly increases the complexity. A 
model which fits somewhere in between the two extremes, allowing only a few important 
internal degrees of freedom, would be advantageous in many cases. 

The dominant low lying normal modes of a helix can easily be incorporated into models 
of protein structure that currently use rigid helical segments. Each mode has an effective 
spring constant K q , and eigenvector v q = (x q> i, x q> 2, ■ ■ ■ , x^l), which can be obtained by 
diagonalizing the spring matrix @. The energy cost (in ksT*) for exciting these internal 
degrees of freedom is 

E = J2 \k q a\ (11) 

This prescription gives a simple way to include the internal degrees of freedom, along with 
the appropriate energy term, into models of protein structure. For shorter helices, only the 
two bend and twist modes need be incorporated. For longer helices that might supercoil, 
including higher order bend harmonics would be required. But nevertheless, describing the 
possible deformations of a helix can be described by adding relatively few extra degrees of 
freedom. 

In summary, we have shown that a-helices have three prominent flexible modes: two 
bend and one twist. The principal modes obtained from static structures in the Protein 
Data Bank agree extremely well with the dynamical normal modes of a simple spring model 
of a helix. Moreover, the static a-helices from the database have independent Gaussian 
distributions of mode amplitudes, consistent with a quasi-thermal equilibrium. Use of these 
dominant "soft modes" may provide an intermediate path between rigid secondary-structures 
and independent all-atom models for protein structure refinement and design. 
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V. METHODS 



To compile a set of protein structures containing a-helices, we selected one represen- 
tative of each fold in the a, a + /3, and a/j3 families from SCOP release 1.550, yielding 
a total of 399 protein structures. Each of these structures was then decomposed into its 
{0, ip, Q} angle sequence, and backbone bond lengths. All backbone atom coordinates could 
be reconstructed from this data. 

Sets of a-helices of given length were extracted from the above structure set as follows: 
We identified a-helices by unbroken series of dihedral angles within a square region {0, ip} = 
{—50 ± 30, —50 ± 30}. For example, a sequence of 15 {4>,ip} angles all falling within the 
defined region would be added to our helix set of 15mers. This same sequence would also 
contribute two Miners, three 13mers, four 12mers, and so on, to the data sets of these other 
lengths. For a given helix length, we scanned all 399 structures, and extracted the a-helical 
fragments. This yielded our representative set of a-helices for lengths L = 10 — 25. 

The {(ft, ip,uj} angles for each of the 399 protein structures from SCOP were calculated 
using the freely available program StrideS 

The eigenvalues and eigenvectors of both the covariance matrix and the spring matrix 
were computed using the eigenvalue solver for real symmetric matrices in the NAG numerical 
library. The elements making up the spring matrix, Eq. (^), were evaluated by computing 
the second derivative of Eq. (0) numerically. 
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TABLES 



PDB ID 


Residues 


L 


Bend 


Bend^ 2 ) 


Bendt 3 ) 


Twist 


Total 


2ZTA 


A 2-29 


28 


70.70 


2.41 


4.66 


7.54 


78.24* 




B 2-29 


28 


75.97 


2.05 


1.00 


13.56 


89.53* 


1QEY 


A 55-80 


26 


77.23 


2.00 


0.00 


7.78 


85.01* 




C 55-80 


26 


77.23 


2.00 


0.00 


7.79 


85.02* 




B 55-79 


25 


89.33 


0.00 


0.00 


4.49 


93.82* 




D 55-79 


25 


89.33 


0.00 


0.00 


4.49 


93.82* 


1M1J 


A 83-160 


78 


61.41 


16.54 


2.80 


11.04 


91.99** 




B 86-163 


78 


0.50 


68.09 


1.00 


16.66 


85.25** 



* Total is sum of Bend and Twist 

** Total is sum of Bend, Bend^ 2 ^ and Twist 



TABLE I. Results of projecting coiled-coil helices onto normal-modes of spring model. Columns 
denote the percentage of the helical displacement accounted for by the specified mode. Bend is the 
sum of the percentages for the two lowest-energy bend modes. Bend^ 2 ^ and Bend^ 3 ) correspond, 
respectively, to percentages for the 2nd and 3rd harmonics of bend. Twist is the percentage for 
the twist normal mode. L is the length of the coiled-coil helix. 
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FIGURE CAPTIONS 



1. Representative set of 47 aligned 18-mer helices. 

2. (a) The ten largest eigenvalues from the principal-component analysis of 18-residue a- 
helices from the representative data set. (b) The scaling of the three largest principal- 
component eigenvalues as a function of helix length L, i.e. number of residues. The 
bend modes are fit to the scaling form A ben d = (k B T* / k)L a yielding k B T* / ' k = 1.378 x 
1CT 5 A~ 2 . The twist mode is fit to the scaling form A tw i St = (k B T*/c)L 2 yielding 
k B T*/c = 0.0022. 

3. (a) Exaggerated bend mode of a helix. (Average structure in blue, bent structure in 
green), (b) Exaggerated twist mode of a helix (Average structure in blue, twisted 
structure in green). The helices are 18 residues long. 

4. (a,b,c) Histograms of projections onto the two bend modes and one twist mode ob- 
tained from the principal-component analysis. Data is shown for the 1182 18-residue 
a-helices from the representative data set. Solid lines correspond to Gaussian fits to 
the data. The fitted variances are 1.55 A 2 , 1.53 A 2 , and 0.71 A 2 , respectively, for 
the two bend modes and one twist mode. (d,e,f) Projections onto two-dimensional 
subspaces spanned by the two bend modes and one twist mode, for the same 1182 
18-residue a-helices. The results are consistent with uncorrected modes. 

5. (a) The three largest principal-component eigenvalues for helices of length L = 10 — 
25 (discrete data points) compared to the inverse spring constants for the normal 
modes obtained from the spring model (continuous curves). To obtain this fit, we 
used real-space spring constants K 2 , 3 = 20k B T*/A 2 and K 4 = 7k B T*/A 2 . (b) Plot of 
eigenvectors for the two bend and twist modes for L = 18 (graphs from top to bottom 
correspond to bend 1, bend 2, and twist). There are three coordinates for each C a 
position, thus the length of each eigenvector is 3 x 18 = 54. Shown is the overlap of 
eigenvectors from the principal-component analysis (filled circle = bend 1, open square 
= bend 2, filled diamond = twist) with those from the spring model (curves). The 
curves are not fits to the PCA eigenvector data; the curves are the eigenvectors from 
the spring model with the same spring constants used in panel (a). 

6. (a) Helices making up coiled-coil in the leucine zipper (2ZTA). (b) Tetramerization 
domain of Mnt repressor (1QEY). (c) Coiled-coil fragment from chicken fibrinogen 
(1M1J). 



13 



FIGURES 




Emberly et al. Fig. 1 
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